Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 13580 |
| Missing cells | 13256 |
| Missing cells (%) | 4.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.2 MiB |
| Average record size in memory | 168.0 B |
Variable types
| NUM | 13 |
|---|---|
| CAT | 8 |
Suburb has a high cardinality: 314 distinct values | High cardinality |
Address has a high cardinality: 13378 distinct values | High cardinality |
SellerG has a high cardinality: 268 distinct values | High cardinality |
Date has a high cardinality: 58 distinct values | High cardinality |
Bedroom2 is highly correlated with Rooms | High correlation |
Rooms is highly correlated with Bedroom2 | High correlation |
BuildingArea has 6450 (47.5%) missing values | Missing |
YearBuilt has 5375 (39.6%) missing values | Missing |
CouncilArea has 1369 (10.1%) missing values | Missing |
Landsize is highly skewed (γ1 = 95.23740045) | Skewed |
BuildingArea is highly skewed (γ1 = 77.69154092) | Skewed |
Address is uniformly distributed | Uniform |
Car has 1026 (7.6%) zeros | Zeros |
Landsize has 1939 (14.3%) zeros | Zeros |
Reproduction
| Analysis started | 2020-10-08 17:28:56.101470 |
|---|---|
| Analysis finished | 2020-10-08 17:29:29.010838 |
| Duration | 32.91 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 314 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 106.1 KiB |
| Reservoir | 359 |
|---|---|
| Richmond | 260 |
| Bentleigh East | 249 |
| Preston | 239 |
| Brunswick | 222 |
| Other values (309) |
| Value | Count | Frequency (%) | |
| Reservoir | 359 | 2.6% | |
| Richmond | 260 | 1.9% | |
| Bentleigh East | 249 | 1.8% | |
| Preston | 239 | 1.8% | |
| Brunswick | 222 | 1.6% | |
| Essendon | 220 | 1.6% | |
| South Yarra | 202 | 1.5% | |
| Glen Iris | 195 | 1.4% | |
| Hawthorn | 191 | 1.4% | |
| Coburg | 190 | 1.4% | |
| Other values (304) | 11253 | 82.9% |
Unique
| Unique | 21 ? |
|---|---|
| Unique (%) | 0.2% |
Length
| Max length | 18 |
|---|---|
| Median length | 9 |
| Mean length | 9.79646539 |
| Min length | 3 |
| Distinct | 13378 |
|---|---|
| Distinct (%) | 98.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 106.1 KiB |
| 36 Aberfeldie St | 3 |
|---|---|
| 28 Blair St | 3 |
| 53 William St | 3 |
| 2 Bruce St | 3 |
| 5 Charles St | 3 |
| Other values (13373) |
| Value | Count | Frequency (%) | |
| 36 Aberfeldie St | 3 | < 0.1% | |
| 28 Blair St | 3 | < 0.1% | |
| 53 William St | 3 | < 0.1% | |
| 2 Bruce St | 3 | < 0.1% | |
| 5 Charles St | 3 | < 0.1% | |
| 1/1 Clarendon St | 3 | < 0.1% | |
| 13 Robinson St | 3 | < 0.1% | |
| 5 Margaret St | 3 | < 0.1% | |
| 14 Arthur St | 3 | < 0.1% | |
| 2/13 Walker St | 2 | < 0.1% | |
| Other values (13368) | 13551 | 99.8% |
Unique
| Unique | 13185 ? |
|---|---|
| Unique (%) | 97.1% |
Length
| Max length | 27 |
|---|---|
| Median length | 13 |
| Mean length | 13.51045655 |
| Min length | 8 |
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.937997054 |
|---|---|
| Minimum | 1 |
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 106.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9557479384 |
|---|---|
| Coefficient of variation (CV) | 0.3253059553 |
| Kurtosis | 0.7940679895 |
| Mean | 2.937997054 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.3764780328 |
| Sum | 39898 |
| Variance | 0.9134541218 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 3 | 5881 | 43.3% | |
| 2 | 3648 | 26.9% | |
| 4 | 2688 | 19.8% | |
| 1 | 681 | 5.0% | |
| 5 | 596 | 4.4% | |
| 6 | 67 | 0.5% | |
| 7 | 10 | 0.1% | |
| 8 | 8 | 0.1% | |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 681 | 5.0% | |
| 2 | 3648 | 26.9% | |
| 3 | 5881 | 43.3% | |
| 4 | 2688 | 19.8% | |
| 5 | 596 | 4.4% |
| Value | Count | Frequency (%) | |
| 10 | 1 | < 0.1% | |
| 8 | 8 | 0.1% | |
| 7 | 10 | 0.1% | |
| 6 | 67 | 0.5% | |
| 5 | 596 | 4.4% |
Type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 106.1 KiB |
| h | |
|---|---|
| u | |
| t |
| Value | Count | Frequency (%) | |
| h | 9449 | 69.6% | |
| u | 3017 | 22.2% | |
| t | 1114 | 8.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Price
Real number (ℝ≥0)
| Distinct | 2204 |
|---|---|
| Distinct (%) | 16.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1075684.079 |
|---|---|
| Minimum | 85000 |
| Maximum | 9000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 106.1 KiB |
Quantile statistics
| Minimum | 85000 |
|---|---|
| 5-th percentile | 405000 |
| Q1 | 650000 |
| median | 903000 |
| Q3 | 1330000 |
| 95-th percentile | 2290050 |
| Maximum | 9000000 |
| Range | 8915000 |
| Interquartile range (IQR) | 680000 |
Descriptive statistics
| Standard deviation | 639310.7243 |
|---|---|
| Coefficient of variation (CV) | 0.5943294472 |
| Kurtosis | 9.874338886 |
| Mean | 1075684.079 |
| Median Absolute Deviation (MAD) | 313000 |
| Skewness | 2.239624313 |
| Sum | 1.46077898e+10 |
| Variance | 4.087182022e+11 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1100000 | 113 | 0.8% | |
| 1300000 | 109 | 0.8% | |
| 800000 | 109 | 0.8% | |
| 650000 | 109 | 0.8% | |
| 600000 | 104 | 0.8% | |
| 1000000 | 97 | 0.7% | |
| 1200000 | 97 | 0.7% | |
| 900000 | 95 | 0.7% | |
| 700000 | 91 | 0.7% | |
| 1400000 | 89 | 0.7% | |
| Other values (2194) | 12567 | 92.5% |
| Value | Count | Frequency (%) | |
| 85000 | 1 | < 0.1% | |
| 131000 | 1 | < 0.1% | |
| 145000 | 2 | < 0.1% | |
| 160000 | 1 | < 0.1% | |
| 170000 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 9000000 | 1 | < 0.1% | |
| 8000000 | 1 | < 0.1% | |
| 7650000 | 1 | < 0.1% | |
| 6500000 | 1 | < 0.1% | |
| 6400000 | 1 | < 0.1% |
Method
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 106.1 KiB |
| S | |
|---|---|
| SP | |
| PI | |
| VB | |
| SA | 92 |
| Value | Count | Frequency (%) | |
| S | 9022 | 66.4% | |
| SP | 1703 | 12.5% | |
| PI | 1564 | 11.5% | |
| VB | 1199 | 8.8% | |
| SA | 92 | 0.7% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.335640648 |
| Min length | 1 |
| Distinct | 268 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 106.1 KiB |
| Nelson | |
|---|---|
| Jellis | |
| hockingstuart | |
| Barry | |
| Ray | 701 |
| Other values (263) |
| Value | Count | Frequency (%) | |
| Nelson | 1565 | 11.5% | |
| Jellis | 1316 | 9.7% | |
| hockingstuart | 1167 | 8.6% | |
| Barry | 1011 | 7.4% | |
| Ray | 701 | 5.2% | |
| Marshall | 659 | 4.9% | |
| Buxton | 632 | 4.7% | |
| Biggin | 393 | 2.9% | |
| Brad | 342 | 2.5% | |
| Woodards | 301 | 2.2% | |
| Other values (258) | 5493 | 40.4% |
Unique
| Unique | 78 ? |
|---|---|
| Unique (%) | 0.6% |
Length
| Max length | 23 |
|---|---|
| Median length | 6 |
| Mean length | 6.402503682 |
| Min length | 1 |
| Distinct | 58 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 106.1 KiB |
| 27/05/2017 | 473 |
|---|---|
| 3/06/2017 | 395 |
| 12/08/2017 | 387 |
| 17/06/2017 | 374 |
| 27/11/2016 | 362 |
| Other values (53) |
| Value | Count | Frequency (%) | |
| 27/05/2017 | 473 | 3.5% | |
| 3/06/2017 | 395 | 2.9% | |
| 12/08/2017 | 387 | 2.8% | |
| 17/06/2017 | 374 | 2.8% | |
| 27/11/2016 | 362 | 2.7% | |
| 29/07/2017 | 341 | 2.5% | |
| 4/03/2017 | 337 | 2.5% | |
| 25/02/2017 | 333 | 2.5% | |
| 24/06/2017 | 329 | 2.4% | |
| 10/12/2016 | 319 | 2.3% | |
| Other values (48) | 9930 | 73.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.724815906 |
| Min length | 9 |
Distance
Real number (ℝ≥0)
| Distinct | 202 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.13777614 |
|---|---|
| Minimum | 0 |
| Maximum | 48.1 |
| Zeros | 6 |
| Zeros (%) | < 0.1% |
| Memory size | 106.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2.6 |
| Q1 | 6.1 |
| median | 9.2 |
| Q3 | 13 |
| 95-th percentile | 20.6 |
| Maximum | 48.1 |
| Range | 48.1 |
| Interquartile range (IQR) | 6.9 |
Descriptive statistics
| Standard deviation | 5.868724943 |
|---|---|
| Coefficient of variation (CV) | 0.5788966792 |
| Kurtosis | 5.260001109 |
| Mean | 10.13777614 |
| Median Absolute Deviation (MAD) | 3.35 |
| Skewness | 1.676937083 |
| Sum | 137671 |
| Variance | 34.44193246 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 11.2 | 739 | 5.4% | |
| 9.2 | 367 | 2.7% | |
| 13.9 | 324 | 2.4% | |
| 7.8 | 306 | 2.3% | |
| 4.6 | 263 | 1.9% | |
| 13 | 252 | 1.9% | |
| 5.2 | 248 | 1.8% | |
| 8 | 248 | 1.8% | |
| 13.8 | 237 | 1.7% | |
| 2.6 | 235 | 1.7% | |
| Other values (192) | 10361 | 76.3% |
| Value | Count | Frequency (%) | |
| 0 | 6 | < 0.1% | |
| 0.7 | 8 | 0.1% | |
| 1.2 | 33 | 0.2% | |
| 1.3 | 5 | < 0.1% | |
| 1.5 | 17 | 0.1% |
| Value | Count | Frequency (%) | |
| 48.1 | 1 | < 0.1% | |
| 47.4 | 1 | < 0.1% | |
| 47.3 | 3 | < 0.1% | |
| 45.9 | 9 | 0.1% | |
| 45.2 | 1 | < 0.1% |
Postcode
Real number (ℝ≥0)
| Distinct | 198 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3105.301915 |
|---|---|
| Minimum | 3000 |
| Maximum | 3977 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 106.1 KiB |
Quantile statistics
| Minimum | 3000 |
|---|---|
| 5-th percentile | 3013 |
| Q1 | 3044 |
| median | 3084 |
| Q3 | 3148 |
| 95-th percentile | 3204 |
| Maximum | 3977 |
| Range | 977 |
| Interquartile range (IQR) | 104 |
Descriptive statistics
| Standard deviation | 90.67696409 |
|---|---|
| Coefficient of variation (CV) | 0.02920069178 |
| Kurtosis | 29.15686787 |
| Mean | 3105.301915 |
| Median Absolute Deviation (MAD) | 50 |
| Skewness | 4.076152215 |
| Sum | 42170000 |
| Variance | 8222.311816 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 3073 | 359 | 2.6% | |
| 3020 | 306 | 2.3% | |
| 3121 | 292 | 2.2% | |
| 3040 | 290 | 2.1% | |
| 3046 | 284 | 2.1% | |
| 3165 | 249 | 1.8% | |
| 3058 | 246 | 1.8% | |
| 3163 | 245 | 1.8% | |
| 3012 | 242 | 1.8% | |
| 3072 | 239 | 1.8% | |
| Other values (188) | 10828 | 79.7% |
| Value | Count | Frequency (%) | |
| 3000 | 46 | 0.3% | |
| 3002 | 22 | 0.2% | |
| 3003 | 31 | 0.2% | |
| 3006 | 41 | 0.3% | |
| 3008 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 3977 | 8 | 0.1% | |
| 3976 | 4 | < 0.1% | |
| 3910 | 6 | < 0.1% | |
| 3810 | 3 | < 0.1% | |
| 3809 | 1 | < 0.1% |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.914727541 |
|---|---|
| Minimum | 0 |
| Maximum | 20 |
| Zeros | 16 |
| Zeros (%) | 0.1% |
| Memory size | 106.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9659210617 |
|---|---|
| Coefficient of variation (CV) | 0.33139326 |
| Kurtosis | 8.074963808 |
| Mean | 2.914727541 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.7740822106 |
| Sum | 39582 |
| Variance | 0.9330034975 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 3 | 5896 | 43.4% | |
| 2 | 3737 | 27.5% | |
| 4 | 2601 | 19.2% | |
| 1 | 691 | 5.1% | |
| 5 | 556 | 4.1% | |
| 6 | 63 | 0.5% | |
| 0 | 16 | 0.1% | |
| 7 | 10 | 0.1% | |
| 8 | 5 | < 0.1% | |
| 9 | 3 | < 0.1% | |
| Other values (2) | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 16 | 0.1% | |
| 1 | 691 | 5.1% | |
| 2 | 3737 | 27.5% | |
| 3 | 5896 | 43.4% | |
| 4 | 2601 | 19.2% |
| Value | Count | Frequency (%) | |
| 20 | 1 | < 0.1% | |
| 10 | 1 | < 0.1% | |
| 9 | 3 | < 0.1% | |
| 8 | 5 | < 0.1% | |
| 7 | 10 | 0.1% |
Bathroom
Real number (ℝ≥0)
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.534241532 |
|---|---|
| Minimum | 0 |
| Maximum | 8 |
| Zeros | 34 |
| Zeros (%) | 0.3% |
| Memory size | 106.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.6917117225 |
|---|---|
| Coefficient of variation (CV) | 0.4508493012 |
| Kurtosis | 3.594973134 |
| Mean | 1.534241532 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.377405972 |
| Sum | 20835 |
| Variance | 0.478465107 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 7512 | 55.3% | |
| 2 | 4974 | 36.6% | |
| 3 | 917 | 6.8% | |
| 4 | 106 | 0.8% | |
| 0 | 34 | 0.3% | |
| 5 | 28 | 0.2% | |
| 6 | 5 | < 0.1% | |
| 8 | 2 | < 0.1% | |
| 7 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 34 | 0.3% | |
| 1 | 7512 | 55.3% | |
| 2 | 4974 | 36.6% | |
| 3 | 917 | 6.8% | |
| 4 | 106 | 0.8% |
| Value | Count | Frequency (%) | |
| 8 | 2 | < 0.1% | |
| 7 | 2 | < 0.1% | |
| 6 | 5 | < 0.1% | |
| 5 | 28 | 0.2% | |
| 4 | 106 | 0.8% |
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 62 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.610075455 |
|---|---|
| Minimum | 0 |
| Maximum | 10 |
| Zeros | 1026 |
| Zeros (%) | 7.6% |
| Memory size | 106.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9626335192 |
|---|---|
| Coefficient of variation (CV) | 0.5978809976 |
| Kurtosis | 5.193182788 |
| Mean | 1.610075455 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.369675926 |
| Sum | 21765 |
| Variance | 0.9266632924 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 2 | 5591 | 41.2% | |
| 1 | 5509 | 40.6% | |
| 0 | 1026 | 7.6% | |
| 3 | 748 | 5.5% | |
| 4 | 506 | 3.7% | |
| 5 | 63 | 0.5% | |
| 6 | 54 | 0.4% | |
| 8 | 9 | 0.1% | |
| 7 | 8 | 0.1% | |
| 10 | 3 | < 0.1% | |
| (Missing) | 62 | 0.5% |
| Value | Count | Frequency (%) | |
| 0 | 1026 | 7.6% | |
| 1 | 5509 | 40.6% | |
| 2 | 5591 | 41.2% | |
| 3 | 748 | 5.5% | |
| 4 | 506 | 3.7% |
| Value | Count | Frequency (%) | |
| 10 | 3 | < 0.1% | |
| 9 | 1 | < 0.1% | |
| 8 | 9 | 0.1% | |
| 7 | 8 | 0.1% | |
| 6 | 54 | 0.4% |
| Distinct | 1448 |
|---|---|
| Distinct (%) | 10.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 558.4161267 |
|---|---|
| Minimum | 0 |
| Maximum | 433014 |
| Zeros | 1939 |
| Zeros (%) | 14.3% |
| Memory size | 106.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 177 |
| median | 440 |
| Q3 | 651 |
| 95-th percentile | 995 |
| Maximum | 433014 |
| Range | 433014 |
| Interquartile range (IQR) | 474 |
Descriptive statistics
| Standard deviation | 3990.669241 |
|---|---|
| Coefficient of variation (CV) | 7.146407581 |
| Kurtosis | 10180.34683 |
| Mean | 558.4161267 |
| Median Absolute Deviation (MAD) | 236 |
| Skewness | 95.23740045 |
| Sum | 7583291 |
| Variance | 15925440.99 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 1939 | 14.3% | |
| 650 | 103 | 0.8% | |
| 697 | 71 | 0.5% | |
| 700 | 48 | 0.4% | |
| 585 | 47 | 0.3% | |
| 534 | 42 | 0.3% | |
| 590 | 39 | 0.3% | |
| 696 | 36 | 0.3% | |
| 649 | 36 | 0.3% | |
| 603 | 35 | 0.3% | |
| Other values (1438) | 11184 | 82.4% |
| Value | Count | Frequency (%) | |
| 0 | 1939 | 14.3% | |
| 1 | 2 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 433014 | 1 | < 0.1% | |
| 76000 | 1 | < 0.1% | |
| 75100 | 1 | < 0.1% | |
| 44500 | 1 | < 0.1% | |
| 41400 | 1 | < 0.1% |
| Distinct | 602 |
|---|---|
| Distinct (%) | 8.4% |
| Missing | 6450 |
| Missing (%) | 47.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 151.9676499 |
|---|---|
| Minimum | 0 |
| Maximum | 44515 |
| Zeros | 17 |
| Zeros (%) | 0.1% |
| Memory size | 106.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 51 |
| Q1 | 93 |
| median | 126 |
| Q3 | 174 |
| 95-th percentile | 294 |
| Maximum | 44515 |
| Range | 44515 |
| Interquartile range (IQR) | 81 |
Descriptive statistics
| Standard deviation | 541.0145376 |
|---|---|
| Coefficient of variation (CV) | 3.560063856 |
| Kurtosis | 6347.802222 |
| Mean | 151.9676499 |
| Median Absolute Deviation (MAD) | 39 |
| Skewness | 77.69154092 |
| Sum | 1083529.344 |
| Variance | 292696.7299 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 120 | 114 | 0.8% | |
| 110 | 89 | 0.7% | |
| 100 | 88 | 0.6% | |
| 130 | 84 | 0.6% | |
| 115 | 77 | 0.6% | |
| 150 | 74 | 0.5% | |
| 104 | 66 | 0.5% | |
| 90 | 65 | 0.5% | |
| 140 | 64 | 0.5% | |
| 112 | 63 | 0.5% | |
| Other values (592) | 6346 | 46.7% | |
| (Missing) | 6450 | 47.5% |
| Value | Count | Frequency (%) | |
| 0 | 17 | 0.1% | |
| 1 | 11 | 0.1% | |
| 2 | 16 | 0.1% | |
| 3 | 20 | 0.1% | |
| 4 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 44515 | 1 | < 0.1% | |
| 6791 | 1 | < 0.1% | |
| 3558 | 1 | < 0.1% | |
| 3112 | 1 | < 0.1% | |
| 1561 | 1 | < 0.1% |
| Distinct | 144 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 5375 |
| Missing (%) | 39.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1964.684217 |
|---|---|
| Minimum | 1196 |
| Maximum | 2018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 106.1 KiB |
Quantile statistics
| Minimum | 1196 |
|---|---|
| 5-th percentile | 1900 |
| Q1 | 1940 |
| median | 1970 |
| Q3 | 1999 |
| 95-th percentile | 2012 |
| Maximum | 2018 |
| Range | 822 |
| Interquartile range (IQR) | 59 |
Descriptive statistics
| Standard deviation | 37.27376222 |
|---|---|
| Coefficient of variation (CV) | 0.01897188459 |
| Kurtosis | 21.22603222 |
| Mean | 1964.684217 |
| Median Absolute Deviation (MAD) | 30 |
| Skewness | -1.54127876 |
| Sum | 16120234 |
| Variance | 1389.33335 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1970 | 866 | 6.4% | |
| 1960 | 725 | 5.3% | |
| 1950 | 580 | 4.3% | |
| 1900 | 341 | 2.5% | |
| 1980 | 338 | 2.5% | |
| 2000 | 300 | 2.2% | |
| 1920 | 280 | 2.1% | |
| 1930 | 274 | 2.0% | |
| 1910 | 240 | 1.8% | |
| 1940 | 238 | 1.8% | |
| Other values (134) | 4023 | 29.6% | |
| (Missing) | 5375 | 39.6% |
| Value | Count | Frequency (%) | |
| 1196 | 1 | < 0.1% | |
| 1830 | 1 | < 0.1% | |
| 1850 | 4 | < 0.1% | |
| 1854 | 1 | < 0.1% | |
| 1856 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 2018 | 1 | < 0.1% | |
| 2017 | 18 | 0.1% | |
| 2016 | 58 | 0.4% | |
| 2015 | 65 | 0.5% | |
| 2014 | 100 | 0.7% |
| Distinct | 33 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 1369 |
| Missing (%) | 10.1% |
| Memory size | 106.1 KiB |
| Moreland | |
|---|---|
| Boroondara | |
| Moonee Valley | |
| Darebin | |
| Glen Eira | |
| Other values (28) |
| Value | Count | Frequency (%) | |
| Moreland | 1163 | 8.6% | |
| Boroondara | 1160 | 8.5% | |
| Moonee Valley | 997 | 7.3% | |
| Darebin | 934 | 6.9% | |
| Glen Eira | 848 | 6.2% | |
| Stonnington | 719 | 5.3% | |
| Maribyrnong | 692 | 5.1% | |
| Yarra | 647 | 4.8% | |
| Port Phillip | 628 | 4.6% | |
| Banyule | 594 | 4.4% | |
| Other values (23) | 3829 | 28.2% | |
| (Missing) | 1369 | 10.1% |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Length
| Max length | 17 |
|---|---|
| Median length | 9 |
| Mean length | 8.457437408 |
| Min length | 3 |
Lattitude
Real number (ℝ)
| Distinct | 6503 |
|---|---|
| Distinct (%) | 47.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -37.80920273 |
|---|---|
| Minimum | -38.18255 |
| Maximum | -37.40853 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 106.1 KiB |
Quantile statistics
| Minimum | -38.18255 |
|---|---|
| 5-th percentile | -37.9348 |
| Q1 | -37.8568225 |
| median | -37.802355 |
| Q3 | -37.7564 |
| 95-th percentile | -37.6989385 |
| Maximum | -37.40853 |
| Range | 0.77402 |
| Interquartile range (IQR) | 0.1004225 |
Descriptive statistics
| Standard deviation | 0.0792598226 |
|---|---|
| Coefficient of variation (CV) | -0.002096310339 |
| Kurtosis | 1.573252695 |
| Mean | -37.80920273 |
| Median Absolute Deviation (MAD) | 0.050455 |
| Skewness | -0.4266949343 |
| Sum | -513448.9731 |
| Variance | 0.006282119479 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| -37.8361 | 21 | 0.2% | |
| -37.7969 | 16 | 0.1% | |
| -37.8424 | 16 | 0.1% | |
| -37.7609 | 14 | 0.1% | |
| -37.8161 | 13 | 0.1% | |
| -37.8414 | 13 | 0.1% | |
| -37.7679 | 13 | 0.1% | |
| -37.7634 | 13 | 0.1% | |
| -37.8573 | 13 | 0.1% | |
| -37.8198 | 13 | 0.1% | |
| Other values (6493) | 13435 | 98.9% |
| Value | Count | Frequency (%) | |
| -38.18255 | 1 | < 0.1% | |
| -38.17488 | 1 | < 0.1% | |
| -38.16802 | 1 | < 0.1% | |
| -38.16762 | 1 | < 0.1% | |
| -38.16624 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| -37.40853 | 1 | < 0.1% | |
| -37.45392 | 1 | < 0.1% | |
| -37.45709 | 1 | < 0.1% | |
| -37.48381 | 1 | < 0.1% | |
| -37.48701 | 1 | < 0.1% |
Longtitude
Real number (ℝ≥0)
| Distinct | 7063 |
|---|---|
| Distinct (%) | 52.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 144.9952162 |
|---|---|
| Minimum | 144.43181 |
| Maximum | 145.52635 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 106.1 KiB |
Quantile statistics
| Minimum | 144.43181 |
|---|---|
| 5-th percentile | 144.835785 |
| Q1 | 144.9296 |
| median | 145.0001 |
| Q3 | 145.058305 |
| 95-th percentile | 145.153631 |
| Maximum | 145.52635 |
| Range | 1.09454 |
| Interquartile range (IQR) | 0.128705 |
Descriptive statistics
| Standard deviation | 0.1039155614 |
|---|---|
| Coefficient of variation (CV) | 0.0007166826888 |
| Kurtosis | 1.758615585 |
| Mean | 144.9952162 |
| Median Absolute Deviation (MAD) | 0.063415 |
| Skewness | -0.2109908954 |
| Sum | 1969035.036 |
| Variance | 0.0107984439 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 144.9966 | 17 | 0.1% | |
| 145.0104 | 15 | 0.1% | |
| 144.985 | 14 | 0.1% | |
| 145.0001 | 13 | 0.1% | |
| 144.991 | 13 | 0.1% | |
| 145.0243 | 12 | 0.1% | |
| 145.021 | 12 | 0.1% | |
| 144.997 | 12 | 0.1% | |
| 145.0043 | 12 | 0.1% | |
| 145.0116 | 12 | 0.1% | |
| Other values (7053) | 13448 | 99.0% |
| Value | Count | Frequency (%) | |
| 144.43181 | 1 | < 0.1% | |
| 144.48571 | 1 | < 0.1% | |
| 144.54237 | 1 | < 0.1% | |
| 144.54532 | 1 | < 0.1% | |
| 144.55106 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 145.52635 | 1 | < 0.1% | |
| 145.48273 | 1 | < 0.1% | |
| 145.47052 | 1 | < 0.1% | |
| 145.45376 | 1 | < 0.1% | |
| 145.4453 | 1 | < 0.1% |
Regionname
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 106.1 KiB |
| Southern Metropolitan | |
|---|---|
| Northern Metropolitan | |
| Western Metropolitan | |
| Eastern Metropolitan | |
| South-Eastern Metropolitan | 450 |
| Other values (3) | 126 |
| Value | Count | Frequency (%) | |
| Southern Metropolitan | 4695 | 34.6% | |
| Northern Metropolitan | 3890 | 28.6% | |
| Western Metropolitan | 2948 | 21.7% | |
| Eastern Metropolitan | 1471 | 10.8% | |
| South-Eastern Metropolitan | 450 | 3.3% | |
| Eastern Victoria | 53 | 0.4% | |
| Northern Victoria | 41 | 0.3% | |
| Western Victoria | 32 | 0.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 26 |
|---|---|
| Median length | 21 |
| Mean length | 20.79690722 |
| Min length | 16 |
Propertycount
Real number (ℝ≥0)
| Distinct | 311 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7454.417378 |
|---|---|
| Minimum | 249 |
| Maximum | 21650 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 106.1 KiB |
Quantile statistics
| Minimum | 249 |
|---|---|
| 5-th percentile | 2185 |
| Q1 | 4380 |
| median | 6555 |
| Q3 | 10331 |
| 95-th percentile | 14949 |
| Maximum | 21650 |
| Range | 21401 |
| Interquartile range (IQR) | 5951 |
Descriptive statistics
| Standard deviation | 4378.581772 |
|---|---|
| Coefficient of variation (CV) | 0.5873808172 |
| Kurtosis | 1.217820011 |
| Mean | 7454.417378 |
| Median Absolute Deviation (MAD) | 2695.5 |
| Skewness | 1.069339349 |
| Sum | 101230988 |
| Variance | 19171978.33 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 21650 | 359 | 2.6% | |
| 8870 | 298 | 2.2% | |
| 14949 | 260 | 1.9% | |
| 10969 | 249 | 1.8% | |
| 14577 | 239 | 1.8% | |
| 11918 | 222 | 1.6% | |
| 9264 | 220 | 1.6% | |
| 14887 | 202 | 1.5% | |
| 10412 | 195 | 1.4% | |
| 11308 | 191 | 1.4% | |
| Other values (301) | 11145 | 82.1% |
| Value | Count | Frequency (%) | |
| 249 | 1 | < 0.1% | |
| 389 | 6 | < 0.1% | |
| 394 | 2 | < 0.1% | |
| 438 | 7 | 0.1% | |
| 457 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 21650 | 359 | 2.6% | |
| 17496 | 46 | 0.3% | |
| 17384 | 3 | < 0.1% | |
| 17093 | 13 | 0.1% | |
| 17055 | 24 | 0.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Suburb | Address | Rooms | Type | Price | Method | SellerG | Date | Distance | Postcode | Bedroom2 | Bathroom | Car | Landsize | BuildingArea | YearBuilt | CouncilArea | Lattitude | Longtitude | Regionname | Propertycount | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Abbotsford | 85 Turner St | 2 | h | 1480000.0 | S | Biggin | 3/12/2016 | 2.5 | 3067.0 | 2.0 | 1.0 | 1.0 | 202.0 | NaN | NaN | Yarra | -37.7996 | 144.9984 | Northern Metropolitan | 4019.0 |
| 1 | Abbotsford | 25 Bloomburg St | 2 | h | 1035000.0 | S | Biggin | 4/02/2016 | 2.5 | 3067.0 | 2.0 | 1.0 | 0.0 | 156.0 | 79.0 | 1900.0 | Yarra | -37.8079 | 144.9934 | Northern Metropolitan | 4019.0 |
| 2 | Abbotsford | 5 Charles St | 3 | h | 1465000.0 | SP | Biggin | 4/03/2017 | 2.5 | 3067.0 | 3.0 | 2.0 | 0.0 | 134.0 | 150.0 | 1900.0 | Yarra | -37.8093 | 144.9944 | Northern Metropolitan | 4019.0 |
| 3 | Abbotsford | 40 Federation La | 3 | h | 850000.0 | PI | Biggin | 4/03/2017 | 2.5 | 3067.0 | 3.0 | 2.0 | 1.0 | 94.0 | NaN | NaN | Yarra | -37.7969 | 144.9969 | Northern Metropolitan | 4019.0 |
| 4 | Abbotsford | 55a Park St | 4 | h | 1600000.0 | VB | Nelson | 4/06/2016 | 2.5 | 3067.0 | 3.0 | 1.0 | 2.0 | 120.0 | 142.0 | 2014.0 | Yarra | -37.8072 | 144.9941 | Northern Metropolitan | 4019.0 |
| 5 | Abbotsford | 129 Charles St | 2 | h | 941000.0 | S | Jellis | 7/05/2016 | 2.5 | 3067.0 | 2.0 | 1.0 | 0.0 | 181.0 | NaN | NaN | Yarra | -37.8041 | 144.9953 | Northern Metropolitan | 4019.0 |
| 6 | Abbotsford | 124 Yarra St | 3 | h | 1876000.0 | S | Nelson | 7/05/2016 | 2.5 | 3067.0 | 4.0 | 2.0 | 0.0 | 245.0 | 210.0 | 1910.0 | Yarra | -37.8024 | 144.9993 | Northern Metropolitan | 4019.0 |
| 7 | Abbotsford | 98 Charles St | 2 | h | 1636000.0 | S | Nelson | 8/10/2016 | 2.5 | 3067.0 | 2.0 | 1.0 | 2.0 | 256.0 | 107.0 | 1890.0 | Yarra | -37.8060 | 144.9954 | Northern Metropolitan | 4019.0 |
| 8 | Abbotsford | 6/241 Nicholson St | 1 | u | 300000.0 | S | Biggin | 8/10/2016 | 2.5 | 3067.0 | 1.0 | 1.0 | 1.0 | 0.0 | NaN | NaN | Yarra | -37.8008 | 144.9973 | Northern Metropolitan | 4019.0 |
| 9 | Abbotsford | 10 Valiant St | 2 | h | 1097000.0 | S | Biggin | 8/10/2016 | 2.5 | 3067.0 | 3.0 | 1.0 | 2.0 | 220.0 | 75.0 | 1900.0 | Yarra | -37.8010 | 144.9989 | Northern Metropolitan | 4019.0 |
Last rows
| Suburb | Address | Rooms | Type | Price | Method | SellerG | Date | Distance | Postcode | Bedroom2 | Bathroom | Car | Landsize | BuildingArea | YearBuilt | CouncilArea | Lattitude | Longtitude | Regionname | Propertycount | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 13570 | Wantirna South | 34 Fewster Dr | 3 | h | 970000.0 | S | Barry | 26/08/2017 | 14.7 | 3152.0 | 3.0 | 2.0 | 2.0 | 674.0 | NaN | NaN | NaN | -37.88360 | 145.22805 | Eastern Metropolitan | 7082.0 |
| 13571 | Wantirna South | 15 Mara Cl | 4 | h | 1330000.0 | S | Barry | 26/08/2017 | 14.7 | 3152.0 | 4.0 | 2.0 | 2.0 | 717.0 | 191.0 | 1980.0 | NaN | -37.86887 | 145.22116 | Eastern Metropolitan | 7082.0 |
| 13572 | Watsonia | 76 Kenmare St | 2 | h | 650000.0 | PI | Morrison | 26/08/2017 | 14.5 | 3087.0 | 2.0 | 1.0 | 1.0 | 210.0 | 79.0 | 2006.0 | NaN | -37.70657 | 145.07878 | Northern Metropolitan | 2329.0 |
| 13573 | Werribee | 5 Nuragi Ct | 4 | h | 635000.0 | S | hockingstuart | 26/08/2017 | 14.7 | 3030.0 | 4.0 | 2.0 | 1.0 | 662.0 | 172.0 | 1980.0 | NaN | -37.89327 | 144.64789 | Western Metropolitan | 16166.0 |
| 13574 | Westmeadows | 9 Black St | 3 | h | 582000.0 | S | Red | 26/08/2017 | 16.5 | 3049.0 | 3.0 | 2.0 | 2.0 | 256.0 | NaN | NaN | NaN | -37.67917 | 144.89390 | Northern Metropolitan | 2474.0 |
| 13575 | Wheelers Hill | 12 Strada Cr | 4 | h | 1245000.0 | S | Barry | 26/08/2017 | 16.7 | 3150.0 | 4.0 | 2.0 | 2.0 | 652.0 | NaN | 1981.0 | NaN | -37.90562 | 145.16761 | South-Eastern Metropolitan | 7392.0 |
| 13576 | Williamstown | 77 Merrett Dr | 3 | h | 1031000.0 | SP | Williams | 26/08/2017 | 6.8 | 3016.0 | 3.0 | 2.0 | 2.0 | 333.0 | 133.0 | 1995.0 | NaN | -37.85927 | 144.87904 | Western Metropolitan | 6380.0 |
| 13577 | Williamstown | 83 Power St | 3 | h | 1170000.0 | S | Raine | 26/08/2017 | 6.8 | 3016.0 | 3.0 | 2.0 | 4.0 | 436.0 | NaN | 1997.0 | NaN | -37.85274 | 144.88738 | Western Metropolitan | 6380.0 |
| 13578 | Williamstown | 96 Verdon St | 4 | h | 2500000.0 | PI | Sweeney | 26/08/2017 | 6.8 | 3016.0 | 4.0 | 1.0 | 5.0 | 866.0 | 157.0 | 1920.0 | NaN | -37.85908 | 144.89299 | Western Metropolitan | 6380.0 |
| 13579 | Yarraville | 6 Agnes St | 4 | h | 1285000.0 | SP | Village | 26/08/2017 | 6.3 | 3013.0 | 4.0 | 1.0 | 1.0 | 362.0 | 112.0 | 1920.0 | NaN | -37.81188 | 144.88449 | Western Metropolitan | 6543.0 |